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1.  Introduction. 

The  purpose  of  this  paper  is  to  derive  asymptotically  efficient 
estimates  for  the  autoregressive  matrix  coefficients  and  moving  average 
covariance  matrices  of  the  vector  autoregressive  moving  average  (VARMA) 
models  in  both  time  and  frequency  domains.  To  do  this  we  shall  apply 
the  Newton-Raphson  and  scoring  methods  to  the  maximum  likelihood 
equations  derived  from  modified  likelihood  functions  under  the  Gaussian 
assumption.  ^ 

The  parameterization  in  this  paper  differs  from  that  of  other  works 
in  the  vector  case,  except  Ahrabi  (1978 ) which  deals  with  the  same 
estimation  problems  in  the  vector  moving  average  case,  and  it  follows 
that  of  Anderson  (1975 ),  Parzen  (1971),  and  Clevenson  (1970)  in  the 
scalar  case.  Hie  usual  parameterization  of  VARMA  models  is  in  terms 
of  the  autoregressive  and  moving  average  coefficients  and  the  covariance 
matrix  of  the  disturbance  vector  With  this  parameterization,  Hannan 

(196 9b, 1970)  has  considered  the  problem  in  the  pure  moving  average  case 
in  the  frequency  domain.  Nicholls  (1976)  has  extended  this  work  to  the 
estimation  of  full  VARMA  models  which  also  contain  exogenous  variables. 
Reinsel  (1976)  has  considered  the  problem  in  the  time  domain  and  has 
derived  estimates  using  the  Newton-Raphson  method  on  the  (modified) 
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maximum  likelihood  equations.  An  iterative  nonlinear  least  squares 
estimation  procedure  has  been  proposed  by  Tunnicliffe  Wilson  (1973  )• 

Other  papers  in  this  area  include  Akaike  (1973  ),  Kashyap  (1970), 

Whittle  (1963),  and  Osborn  (1977). 

As  indicated  above,  there  is  an  alternative  parameterization 
which  we  will  use  in  this  paper.  One  advantage  of  this  parameterization 
is  that  we  do  not  need  to  assume  that  some  past  values  of  e,  's  are 
fixed.  Also  as  Hannan  (1975)  has  pointed  out  it  is  easy  to  recover  the 
original  parameters  using  the  spectral  density. 

Newton  (1975)  considers,  among  other  things,  the  estimation  of 
moving  average  covariance  matrices  in  the  pure  moving  average  case  in 
the  frequency  domain.  But  his  method  is  different  from  the  methods 
used  in  this  paper.  He  regresses  the  elements  of  the  sample  spectral 
density,  evaluated  at  some  equidistant  points,  on  certain  trigono- 
metric functions  using  the  method  of  weighted  least  squares. 

To  summarize.  Chapter  2 describes  the  model  and  the  parameters  to 
be  estimated.  Chapter  3 deals  with  the  estimation  problem  in  the  time 
domain.  The  modified  likelihood  function  is  derived  under  the  assumption 
of  normality,  using  the  method  developed  in  Anderson  (1975).  Then  the 
Newton-Raphson  and  scoring  methods  are  applied  to  the  resulting  maximum 
likelihood  equations.  The  chapter  closes  with  remarks  about  the  computa- 
tional problems.  The  estimation  in  the  frequency  domain  is  discussed  in 
Chapter  4.  The  modified  likelihood  function  used  is  similar  to  that  of 
Whittle  (1953*1961)  and  Dunsmuir  and  Hannan  (19^6).  Again  the  Newton- 
Raphson  and  scoring  methods  are  applied  to  the  maximum  likelihood  equations. 
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Hie  asymptotic  properties  are  discussed  in  Chapter  5.  Hie  estimates 
are  shown  to  be  asymptotically  efficient  under  suitable  assumptions. 


In  Chapter  6 we  return  to  the  usual  parameterization  and  derive 
estimates  for  the  autoregressive  and  moving  average  matrix  coefficients 
and  the  covariance  matrix  of  € , using  the  scoring  method  in  the  time 
domain.  Finally  in  the  Appendix  we  present  some  of  the  mathematical 
results  used  in  previous  chapters. 
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2.  The  Model. 


We  have  observations,  y2,  j^,  on  the  process  {^t) 


generated  by 


(2.1) 


£ = i ~j~t-j  - V * - °»  -1’  •"» 


where  j^'s  and  e^'s  are  m x 1 vectors  and  B^'s  and  A.'  s are 
m x m matrices  and  B^  = A^  = 1^.  Let 


(2.2) 


- I 


Si*1  - 


(2.3) 


. i a. 


Assumption  1.  The  £ ' s axe  independently  identically  distributed 
random  vectors  with  mean  zero  and  unknown  covariance  matrix  V. 

Assumption  2.  The  zeros  of  |b(z)|  lie  outside  the  unit  circle. 
Assumption  3.  The  zeros  of  |A(z)|  lie  outside  the  unit  circle. 
Assumption  4.  A greatest  common  left  divisor  of  A(z)  and  B(z) 


is  3L* 
— mu 


Assumption  3.  The  matrix  (Bp,  A^ ) is  of  full  rank,  i.e.. 


r (B.  A ) = m. 
Remarks. 


(i ) Assumption  2 ensures  the  stationarity  of  the  process.  It  also 
makes  ^ independent  of  £t+2>  •••  • 


(ii ) Assumptions  3,  4,  5 ensure  that  the  system  is  identified.  By 
this  we  mean  that  the  autocovariances  of  the  process  defined  by 


(2.4) 


eMt+s  ' = = 0,  +1, 


determine  a(z)  and  b(z)  uniquely.  (See  Hannan  (1969a)).  In  particular 
we  can  get  the  moving  average  matrix  coefficients  from  the  moving  average 
covariance  matrices  defined  by 


(2.5) 


£ = 6 Ht~t+s  ' s = °>  •••»  * > 


uniquely,  by  solving  the  following  system  of  equations 


( s ) 

£ -jl  ■ s ~0’  ^ 


Finally,  assumption  5 is  not  a necessary  condition  and  it  can  be  replaced 
by  other  conditions,  see  Hannan  (1971)  and  Kashyap  and  Nasburg  (1974). 
The  parameters  of  interest  are 


£<S>  -e2tHU  ’ s * °'1'  •••’  1 ’ 


B,  r - 1, . . . ,p 
~r 


Since  we  will  differentiate  the  log  likelihood  function  of  (jr^,  ...»  ) 

with  respect  to  the  elements  of  the  above  matrix  parameters,  it  is  more 
convenient  to  vectorize  them. 
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Definition.  If  C = (c„ , , c_),  where  c4 ' s are  column 

vectors. 


vec  C = 


~1 


'£n 


Before  we  proceed  with  vectorization  of  the  matrix  parameters,  we 


.(0) 


notice  that  Zv J is  symmetric  and  hence  should  be  treated  differently 


(s ) 

from  Z'  , s = 1, 


.(0) 


, q.  In  vectorizing  Z v~/  we  only  need  to 
vectorize  the  diagonal  and  lower  diagonal  elements.  So  we  let 


So 


(1) 


dg  Z 


(0)  . 


mm 


= vec  , 


where  vec  is  an  operator  that  vectorizes  the  elements  of  the  matrix 
that  it  is  applied  to,  ignoring  the  diagonal  and  upper  diagonal  elements, 
e.g.. 


vec 


1 

3 

f 0 

0 

5 

2 

4 

w 

8 

i ! 

i 8 

So  the  parameters  are 


*1 


I 


Finally  we  can  state  that  the  parameters  of  interest  are  the  components  of 
p where 


(2.6)  £<  = (£’,£•)  = (£',  g’,  e^,  £',  01) 


We  shall  also  find  it  useful  to  introduce  another  vector,  6 vAiere 


(2.7) 


£•  = (vec’  T^°\  vec’  ...,  vec'  £^) 


- (&  **•»  £q^  * 


Remarks . 


(i ) We  can  find  a matrix  £ such  that  for  any  m x m matrix 


(2.8) 


dg(A)  = C vec  A 


2 2 

It  is  easy  to  see  that  C is  obtained  from  the  m x m identity  matrix 
by  deleting  all  the  rows  except  the  1st,  m+2nd,  2m+3rd,  ...,  m2th,  i.e., 


e 1 


where 
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m m 


(ii ) Similarly  we  can  find  an  m ^ x m2  matrix  D such  that 
for  any  m x m matrix  A 

(2.9)  vec  A = D vec  A . 

! 

It  is  easily  verified  that  D is  obtained  from  I 2 by  deleting  the 

~m 

following  rows 


m+1 , 2m+l t ii<| 

(m-1  )m+l 

m+2 t 2m+2 } . . ■ f 

(m-1  )m+2 

2m+3 , . . . , 

(m-1  )m+3 

(m-1  )m+m 

3.  Estimation  in  the  Time  Domain. 


3.1.  The  Likelihood  Function. 

We  axe  concerned  with  maximum  likelihood  estimation,  so  the  first 

step  is  to  find  the  likelihood  function.  To  find  this  we  need  to  know 

the  distribution  of  But  in  the  previous  chapter  we  assumed  that 

e^'s  are  i.i.d. , without  assuming  any  particular  distribution.  However 

in  deriving  the  likelihood  function  we  shall  treat  e 's  as  normal 

'"'t 

vectors  and  later  we  shall  demonstrate  that  the  resulting  estimates  have 
the  same  asymptotic  covariance  matrix  irrespective  of  the  distribution 
of  e^'s  as  long  as  the  assumptions  in  Chapter  2 are  satisfied. 

Even  with  the  assumption  of  normality  we  cannot  find  the  exact 
likelihood  function  except  in  the  pure  moving  average  case.  In  order  to 
find  an  approximate  likelihood  function,  following  Anderson  (1975),  we 
assume 

lo  = 3Li  = = JCi-p  = 2 • 

The  likelihood  function  we  will  derive  is  in  fact  the  conditional  likeli- 
hood of  6iven  ^at  Xo>X-L>* " ' 'X/L-p  8X6  equal  to  expected 

values . Now 

Xt  + 2iXt-i  + •••  + 2pXt-p  = St  • 

Transposing  both  sides  yields 

3Ct  + XU21  + • ’ • + lt-^p  = St  • 

Writing  these  equations  for  t = 1,2,  ...,  T,  we  get 
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(3.1.2) 


l 

i=0 


L YB.  = U , 


vhere 


X'  = (Xi»  •••>  XT)  > ST  * (V  •••'  V * 

Now  we  shall  need  the  following  lemma  in  vectorizing  (3.1.2). 
Lemma  3.1. 

vec  ABC  = (C*  ® A)vec  B . 

w ~ ^ 

See  MLnc  and  Marcus  (I96U-  )- 

Using  this  lemma  on  the  left  hand  side  of  (3.1.2)  we  get 


(3.1.3) 


Now  let 


£ (B,  <2>  L1  )vec  Y = vec  U . 


i=0 


* = £ (Si<2>  L1 ) , JC  = vec  Y , u = vec  U , 


i=0 


then  (3.1.3)  can  be  written  as 


Because  of  the  Gaussian  assumption,  to  find  the  likelihood  function 
(the  density  of  j£)  we  only  need  to  find  the  covariance  matrix  of  u. 
Now,  it  follows  from  the  normality  of  e.'s  that 


(3.1.5) 


where 


/ ~L  \ 

'V 


N(0,  Z)  , 


(0) 


+ (L1  <2>  ^ +L©I^^)  + ..*  + (L,q®£^  + Lq®  Z^'  ) 

**  *+*  v **  'iv  ^ mm' 


The  following  lemma  will  enable  us  to  derive  the  distribution  of  u 
from  (3.1.5). 
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Lemma  3.2.  If  A is  any  r x s matrix. 


vec  A'  = K vec  A , 
~ ~r.s  ~ 


vfcere  g is  a square  rs  x rs  matrix  partitioned  into  r x s 


submatrices  such  that  the  i.i^  block  has  a 1 in  the  .ii^1  position 


f 


This  means  that 


b 

~r 


= vec  A’ 


Q.E.D. 


Note.  We  shall  use  the  convention 


K = K M . 
~n  ~n,n 


Now  we  use  the  above  lemma  to  derive  the  distribution  of  u.  By 
definition 


u = vec  U = vecfuj,  ..., 
Now,  using  lenma  3.2  we  have 


vec (~i»  »r)  = jjj) 


~ta,T 

This,  together  with  (3.1. 5),  yields 

Jd  ~ N(o,  r ) , 


ill 

Sr 


where 
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J 


£ “ £m,T  ~ £ta,T  * 


Lemma  3«3. 


(i)  If  A and  B are  m x n and  r x s matrices  respectively. 


K (A®  B)K  = B®  A . 
~m,r  ~ ~ ~s,n  ~ ~ 


(ii)  5r>s  = &,T 


See  MacRae  (1974). 


Using  this  lemma  and  noticing  that  Z is  given  by  (3.1.6)  we  get 
(3.1.7)  £ = Z(0)®  Jp  + +Z(l)'®L)  + ...  + (E(q)®  L,q  + Z(qV  ® L(q))  . 


We  shall  be  using  these  lemmas  in  the  later  chapters  as  well. 
Finally  from  (3.1.3)  we  get 


X~N(o,  8_1  r#'-1)  . 


This  gives  us  the  (modified)  log  likelihood  function 


(3.1.8)  log  t (X)  = - | s’  r'1*^  + \ loglr'1!  - ^ log 


since  Isl  = 1,  as  we  shall  see  in  (3-5. 2).  The  maximum  likelihood 
estimates  are  a set  of  roots  of 
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d log  i (^) 

= o . 

So  we  proceed  to  derive  the  first  derivative  of  log  £(jr). 

3.2.  The  First  Derivative  of  log  l (^). 

In  differentiating  the  log  likelihood  function  with  respect  to  £ 
we  only  need  to  differentiate  the  first  term  and  by  using 

g(i^S_L)=  o 
3 e 


complete  the  derivative.  Hiat  is 


(3.2.1) 


d log  i 1 

■ire  = ' 2 


Ste’*'!:'1*  ti>  jr) 

si  e W~ 


We  shall  also  use  the  following  lemma 

Lemma  3.U.  For  any  two  column  vectors  x and  z 


vec  (x  z ’ ) = z <E>  x * 


Proof.  It  is  easily  verified  by  writing  out  the  two  sides. 

The  Derivative  With  Respect  to  Autoregressive  Coefficients. 

Let  bi^  denote  the  i,:jth  element  of  13  then  from  (3.1.3) 
i.i 


(3.2.2) 


5b. 


= E . . <2>  L , 
~ij  ~ 


= 1, 


m 


id 


Using  this  we  get 
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(5.2.3) 


^rs'L'1 


Now  using  lemma  3.1 


— - I’S'E"1^  ® hrk  * £ %i®  t 'lE"1*  l 

* 2r  a,£'1QSu®srfe  • 

we  have 


= vec^'l'r-1^®  L1*^] 

= (X'®K,S,r-1)vec(EiJ®  Lr)  . 


This  together  with  (3.2.3),  yields 


dfr'S’r"1*  y)  . 

(5.2.4)  ~sV  = 2(r®r®’£  4 > 

~r 


where 


Which  in  turn  yields 

d(y 'B'r-1®  y) 

/T  ^ r*  \ **  ~ ~ K,  ' — I . 

(5.2.5)  5^5 = 2(X'®£’B'r  )E  , 

where 

(5.2.6)  E = (En,  ...,  E ) . 

~ 7 ~p' 


Finally  using  (3.1.8)  we  get 
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(3.2.7) 


E"1**.*  • 

*0 


The  Derivative  with  Respect  to  Moving  Average  Covariances. 

As  indicated  by  (3.2.1)  we  only  need  to  find 

d y1  BT-1®  v 

IV/  IV  (V  IV  Iv 

Te • 

IV 

However  it  is  more  convenient  to  find 


3 e 


which  is  related  to  the  former  derivative.  To  find  the  relationship  we 
note  that 


dr 


da. 


id 


dr 

T7o7, 

\doji  / 


where  and  are  treated  as  different  variables.  This  means 


dfc'jri:'1*  sc) 


da. 


w 

ij 


da 


Jo T 


Ji 


which  in  turn  yields 


(3.2.8) 


d ic'JX"1-®  l 

Saij 


= 2 157 


da. 


id 


.(0) 


where  " - indicates  that  we  take  the  symmetry  of  Ev into  account. 
In  view  of  (2.8)  and  (2.9)  using  (3.2.8)  we  get 
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(3.2.9) 


dX,J8,£"1®X  ^ JC' »'  £_1S  JC 

= G 


2 in  (m+1 ) 2 

where  G is  a [qm  + — K— — -]  x (q+l)m  matrix  which  can  be  written 


„ m(m-l ) 

S2  2 » 

„ I 2 

2 3/  ^ 


G1  = (C,  0),  a = (2D,  0),  G,  = (0,  I) 


with  C and  D as  in  Chapter  2.  It  is  obvious  that  from  (3.2.9)  we 
can  conclude 


(3.2.10! 


5 log  l _ „ 3 log  i 

~ ~ ae 


We  now  proceed  to  derive  — ^ . Using  lemma  3.1  we  have 


(3.2.11) 


a y*  S'  r'1®  y a vec(y'  S'  r'1®  y ) 


a vec  r" 


(^'®'  ® 


, br  1 

= -(x,®,©i:,®,  )yec^r  -X.  £ ) 

ae 


■te'S'  <2>  X'®’  )(£‘1<2>£’1) 


a vec  r 


■(K’S'^OK’S’r1) 


a vec  r 
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a vec  r 


To  find  we  need  to  find  — 7 — r , for  s=  0,1,..., q and 

ae'  ao, . 

~ ij 


i,j  =l,...,m.  From  (3.1.6)  we  get 


(3.2.12) 


T75T  ~ Sij®^T 

J-j 


(3'2,13)  Trir  ~ ~ S + ~ ' s - 1,  ...,  q,  i,j  - 1,  ...,  m, 

dCTij 

Where  is  an  m x m matrix  with  one  in  the  ijth  position  and  zeros 

elsewhere.  Now,  vectorizing  (3.2.12)  yields 


a vec  r 


r = vec(£ij®  iq.)  = say, 


which  yields 


a vec  f 


(3  2 14)  U ~ r(0)  a(0)  a (° ) ) _ p sav 

U ^ - l«11  , “21  ' Xmm  )~Zo’  Say 


Similarly 


(3.2.15) 


a vec  r 

= F 

^ ~s 

a0c 

~S 


(a (s) 


. , a^)  , 

7 ~mm  ' J 


where 


~ij  = Ve^Sij®i's  + Sji ®£S]  > 5 = • 


So  the  derivative  with  respect  to  6'  is 
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(3-2.16) 


vec  r 

— = (&'  El'  V = l > say  • 

30 


Now  using  (3.2.11)  we  have 


(3-2.17) 


3 y'$'r  ® y .1  -1 

* ~ — = -P'  (r  9 i<2>  £ 1) 


3 0 


To 


complete  the  derivative  of  5 ^ ^ need  to  find  the  expected 


3e 


value  of  (3  • 2 . l6 ) 


e 


3 ®'  £ -1®  x,\ 


6 £ 


= -f'  e vec(£~1®x*C'S'£~"L^ 

= vecCr"1*®-^  ®'  ‘Vr'1) 


.-1. 


-F’  Vec  T 


-1 


Using  (3-2.1)  we  get 


(3-2.18) 


d lQ6  ^ = - f'  (r-1B  x,®  r-1®  Vec  r' 

30  2~~~ 


-ll 


Finally  vising  (3.2.10),  we  have 


(3.2.19) 


d 1o6-1.  = i-  g F'  [r-1fi  £<2>  r-1®  vec  r' 

3e 


-l, 


3.3.  The  Numerical  Approximations. 
The  equation 

3 log  l (jj) 


d £ 


= 0 


is  nonlinear  and  cannot  be  solved  explicitly.  Therefore  we  will  use 
numerical  approximations  that  yield  asymptotically  efficient  estimates. 
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— 


These  methods  are  the  Newton-Raphson  and  Scoring  methods.  Both  methods 
require  that  we  start  with  an  initial  estimate  that  is  consistent  of 


we  can  replace  the  autocovariances  i.e. , 


V ' s by  their  sample  analogues, 

C^'s,  and  then  solve  the  resulting  equations  to  obtain  initial  estimates 

for  B 's.  So  the  equations  are 
~r 


(3.3.3) 


where 


(3.3-4) 


l 

r=l 


B (0)C  = -C 


~r '' '~r-s  -£-s  ’ s = q+1,  ...,  q+p 


T-k 


£k  = | JCtxUk  = s:k  • 


Having  obtained  these  estimates  we  can  form 


Ht<0)  * r|0!r(0>Xt.r  ’ ‘ -1'  -•>  T 


Bear  in  mind  that  1^(0)  = I and  ^ = • • • = = 0.  Now  we  can  use 

A , 

lit* 


the  same  autocovariances  of  5 (o)'s  to  estimate  s = 0,1,  ...,  q. 


We  estimate  by 


(3.3-5)  i £ u (0)u!  (0)  - , s 0,1,  ..., 


-(0) 


T ~tv  ~t+s 


(0) 


Finally  by  vectorizing  the  initial  estimates  obtained  in  this  manner  we 
get  an  initial  estimate  of  p,  which  we  shall  denote  by  P(q)* 

Note.  Hie  initial  estimate  £(o)  may  no^  satisfy  the  conditions  for 
a moving  average  covariance  matrix.  For  example  when  m=q=l,  p=0 
the  constraint  on  £ is 

2 

!l  < 1 

— < IT  • 

ao 
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The  Newton-Raphson  Method. 


This  method  consists  of  solving  the  following  system  of  linear 


equations  for  p. 


3 log  l 


^^£4(0) 


(£(i)-e(o))  * 


•£-£(o) 


So  we  need  to  find  the  second  partial  derivatives  of  log  l. 


d2log  l 


Derivation  of  ^ ,■  . 


As  in  (3.2.7) 


Differentiating  this  with  respect  to  b;  / yields 

* J 


d2log  l 


ae  ab,vi. 

~ ij 


FT  ■ -£■  tK®  £_1  CEij  <S>  ir>K] 


Now  using  lemma  5*k  we  can  rewrite  the  term  inside  the  brackets  as 


Vect~’1(Sij®ir)XX,5  = ^r^jvecf^j®  Lr) 


where  we  have  used  lemma  3.1.  So  finally  we  have 


(3.3.6) 


fey..  -E'[IX'®r1m 


I I 


2k 


Derivation  of 


52log  l 

'Site'-  * 


.(s) 


Differentiating  (3.2.7)  with  respect  to  ' we  get 


d2log  l 


= E'[x®C"1(-T?r)r1»):] 

s<,iJ 


Now  using  lemma  3.^  the  right  hand  side  can  be  rewritten  as 


dr 


S'  vec[£’1  -7rr— ^,]  » 

da.  . 

10 


which  in  turn  can  be  rewritten  using  lemma  3.1  as 


. ” 1 iPx  n”l  ’ 


d vec  r 


This  means 


I'kti'i  ®£  ) — jrr  • 


>2 d vec  r 

-1  6 £ = E'  (vy1  ST'1®  r'1)  - 

de  de*  ~ ~ de 


But  (3.2.15)  states 


d vec  f 


d©' 


= F . 


So 


dP  de' 
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; 


Finally  we  get 


(3.3.7)  - &SLL  o'  . j.  (jcx'J'i'-1®  [‘‘iPJ'  . 

dP  de’  dp  de1 

MM  MM 

2 

Derivation  of  - . 

de  de* 

M M 

As  in  (3.2.17)  we  have 


djog-i.  = i.  F<  [r-1B  y®  r"1®  y - vec  r-1]  . 

M M M M MMX#  M 

ae 


(s  ) 

Differentiating  this  with  respect  to  ay . we  get 


dr 


(3.3.8)  d log  < = l F1  [ -r"1 — r^sy  ® r_1»y 


de  dCT. 


ij 


H 

1 , dr  . dr  . 

£"  sx®l  — Sr Si* vec-"  arr  f 1 

H *13 


Now,  using  lemma  3.^  the  right  hand  side  of  (3.3.8)  can  be  rewritten  as 


\ r ''“i-£'1Sii,s,£'1  rrrrE'^ii'rr1 

Hi  H 

-1  ^ ~ -1 
*r.  ~~jsj  £ ^ * 


dr 


-1  „-i  „-i. 


dcr. 


ij 


And  using  lemma  3.1  this  can  be  rewritten  as 
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(3.3.13) 


E'  0 


0 GF' 


E 0 


0 FG 


J (fi(i)-£(o))  = 5(0)  » 


where  £ = which  is  given  by  (3-2.7)  and  (3.2.18).  Once  we  have 

we  could  carry  out  a second  iteration  by  replacing  with 

£(2)’  £(0)  With  £(l)'  2(o)  with  2(i)  611(1  &(o)  with  ~(l)  in  (3.3.13) 
and  solve  for  £(2)*  But  even  f°r  samples  of  moderate  size  this  would 

be  computationally  very  costly. 

The  Scoring  Method. 

The  equation  for  this  method  is  given  by  (3.3.2).  We  notice  that 


we  have  to  find 


Taking  expectations  of  both  sides  of  (3.3*11 ) we  get 


(3.3. 1U) 


Now  recall  that 


(SplofiJ.)  = 


E'  0 E 0 

**  ***  ^ _ **  *** 

e n 

0 GF'/  ~ ,0  FG' 


e(ZJC' ) ~®"L£  S'"1 


Applying  this  to  (3-3.9)  we  get 


e(M)  = r"1®  r_1 


We  can  also  easily  find  the  expectation  of  the  other  entries  of  n.  The 
end  result  is 
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e it  = 


8"1rs"1©r‘1 


s’’1®  r"1 


a"1©  r-1 


= A > say. 


Substituting  this  in  (3.3.14)  we  get 


(5.3.15) 


e( 


a>2iog  i 
dp  dp' 


r E'  0 ) 

! E o' 

A 

, 0 GF'| 

( 0 FGV 

1 w » 

So  finally  the  equations  for  the  Scoring  method  are 


(3.3.16) 


/ E*  0 \ 

A 

£ 

0 GF’ I 

, <V  <v»v  / 

~(o ) 

12 

EG' 

/ 

(£(1)^(0))  = &(0)  • 


3.4.  The  Pore  Moving  Average  Case. 

The  pure  moving  average  case  corresponds  to  the  model  defined  by 
(2.1)  with  e = 0.  This  case  was  treated  by  Ahrabi  (1978).  But  we 
can  also  derive  the  estimation  equations  from  (3.3.13)  and  (3-3.16) 
by  letting 


£ = 0 > £ = £ • 

The  resulting  equations  are  in  fact  simplified  versions  of  those  of 
Ahrabi  (1978).  This  is  because  in  the  latter,  the  second  order 
derivatives  of  the  log  likelihood  have  a more  complex  representation 
and  also  that  for  the  Scoring  method  instead  of 
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- g 


d2log  l 


we  had  used 


(aios_£  . aiogj.) 

He1 J * 

which  proved  to  be  more  cumbersome. 

An  important  distinction  between  the  general  case  and  the  case 
of  the  pure  moving  average  model  is  that  the  log  likelihood  for  the 
latter  case,  as  derived  from  (3.1.8)  by  letting  P = 0,  is  the  exsct 
log  likelihood  of  the  data.  That  is  we  do  not  need  to  assume  that 
some  past  values  of  ^ are  fixed. 

The  model,  as  pointed  out  above,  is 

It  = ^t+^t-l  + ’**  + Mt-q  * 

The  parameters  to  be  estimated  are  the  components  of  6 as  defined  in 
Chapter  2,  with  = ufc.  And  obviously  in  this  case  we  only  need 
assumptions  1 and  3. 

The  Newton  Raphson  Method. 

We  get  the  seoond  order  derivative  of  log  l with  respect  to  £, 
by  letting  8 = ^ in  (3.3*10)  which  yields 
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where 


(3.4.2)  m = r*1®  r’\z'  r'1  + r'\i>  r"1©  r*1  - r'1©  r"1 


Now  let 


(3.4.3) 


= vec  Y'  = K^t  vec  Y = , 


where  Y was  introduced  in  (3.1.2).  We  also  recall  that 


(3.4.4) 


£ “ Em,T  ~ ~m,T  ' 


where  Z was  defined  by  (3.1.6).  From  (3.4.3)  we  get 


X £tn,T  2 ’ 


Substituting  this  and  £ as  in  (3*4.4)  in  (3.4.2)  we  get 


m - (5l,T®5i,T)(£'1®£'1K’  s'1  ♦s'1**  s'1®^1 


’ <L,raL,T>  S(5k,T®«i,i) 


say.  Now  if  we  let 


l 


(3.4.6) 


then  (3.4.1)  can  be  rewritten  as 


(3.4.7) 


d2log  l 


i gP*  NFG' 

d *>*  *+* 


We  note  that 


O-1*-8)  <^,1®%.,!)  ™»«u® i‘>  - vec[4i,T(5id® ■ vec(i ^e,hs)  ’ 


using  Lemmas  3.1  and  3.3.  It  is  now  clear  that  F is  what  wa s called  E 
and  £ is  what  was  called  £ in  Ahrabi  (1978).  The  first  order  deriva- 
tive, in  the  same  manner,  is  derived  from  (3.2.19)  which  yields 


2E,[£1s:®£"1it- £'1]  • 


which  is  identical  to  (3.2.11)  of  Ahrabi  (1978).  We  need  an  initial 
estimate  for  6 which  is  derived  from  (3.3. 5)  if  we  replace  u.  by 


. Uiat  is 


~(o) = ^ A Kt3^+s ' s " °A'  9 ’ 


So  finally  the  Newton-Raphson  equations  are 


2 S’  2(0 )££■  ' 2£’  tl7o )Z®  ■ ”<=  15)1 
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The  Scoring  Method. 


We  need  to  find  e • We  get  this  from  (5.3.15)  which 


yields 


e(f^r-)  = - I GF,(rf1©r-1)F,G  = - 1 z_1)f*g 


So  the  equations  for  this  method  axe 


A-l  . 

Tec  5(0)> 


3.5.  The  Problem  of  Computation. 

If  we  look  at  equation  (3.3.13)  and  (3.3.16),  which  are  the  equations 

for  the  Newton-Raphson  and  Scoring  methods  respectively,  we  realize  that 

to  eet  p \ we  have  to  tackle  two  computational  problems: 

~U) 

A A A 

(i)  The  problem  of  computing  n^,  A(0)>  6(0)’ 

(ii)  The  problem  of  solving  the  resulting  equations. 

The  second  problem  is  the  less  serious  one,  because  it  involves  solving 

O 

a set  of  r = (p-Hj)m  + m(m+l)/2  linear  equations.  Even  thougi  r can 
be  large  it  remains  fixed  as  the  number  of  observations  (T)  increases. 

By  comparison  in  the  first,  as  we  shall  see, the  matrices  that  are  to  be 
inverted  have  dimensions  of  order  T.  So  we  shall  concentrate  on  (i ). 

If  we  look  at  v3-3.12)  and  (3.3.15)  it  becomes  apparent  that  the 

A ^ 

major  computational  problem  for  computing  2(q)  an<*  4(o) 

A 

problem  of  inversion  of  8 and  £.  This  is  also  the  case  for  6(g) 
which  is  derived  from  (3.2.7)  and  (3.2.19). 
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Inversion  of  £. 


We  recall  that  as  in  (3.1.4)  S is  given  by 


» = t (S i®*1)  > 


i=0 


where 


L = 


0 o \ 


\ ~T-1  ~ / 


Now,  using  lemma  (3.3)  we  have 


(3.5.1) 


5n,T  ~ 5b,  T 


i ~ 

say.  We  notice  that  since  L is  lower  triangular,  so  is  S.  This 
makes  it  possible  to  find  S via  some  recursive  equations.  It  is 

clear  from  (3-5.1)  that  B has  l's  for  the  diagonal  elements.  This 
means 


(3.5.2)  I Si  = Isl  = 1 , 

which  was  used  in  deriving  (3.1.8).  We  also  notice  that  S is  block  Toeplitz. 
Lemma  3»5»  For  £ defined  by  (3-5.1) 

(3-5.3)  r1  - £ (LJ©B(o)), 

3=0 

where  B^° ^ ^ arid  \ j =1,  ...,  T-l  are  given  by  the  recursive 

equations 
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(3.5.4) 


r B.B^  = 0 , r = 1,2,  T-l 


i+j=r 


Proof.  Multiplying  ® by  the  right  hand  side  of  (3.5*3)  yields 


(3.5*5)  ®¥  (Ld®B(d))=  £ TE(Li+J®B^B(j)) 

~ j=0  i=0  j=0 


T-l  (,\ 

= E [£  ® E , 

r=0  i+j=r 


since  LT+h  = 0,  h = 0,1,  ....  Now  the  first  term  of  (3.3.5)  is 


i°®M(0)  * 5«t  - 


so  letting  the  left  hand  side  of  (3*3.5)  Be  equal  to  1^,  we  get 


£ B,B^  = Q , r = 1,2,  ...,  T-l  . 
i+j=r 


If  we  write  out  the  first  few  equations  it  becomes  clear  that  these 
can  easily  be  solved  recursively.  The  first  equation  is 


So8(1)  * feJ<0)  - o , 


which  yields 


(3.5-6) 


B^  = -B, 


The  second  equation  is 


2o£ 


(2) 


.(i) 


(o) 


B,  B + B~B  = 0 , 


which  yields 


= -B. B^1  ^ - B? 


= B2 

~± 


using  (3.5.6).  So  at  stage  n 


-B,B(n-1)-B,B(n-2) 


B . 


~n 


Notice  that  there  are  n terms  here,  but  we  know  that  for  n > p, 

B = 0.  This  means  that  for  any  n we  shall  at  most  have  p terms. 
~n  ~ 

That  is,  the  equations  (3.5.*+)  axe  recursive  of  order  p.  Hiis  makes 
the  computation  task  much  easier.  Q.E.D. 

From  (3.5.I)  we  get 

<3-5-7>  I'1  - - 

where  we  have  used  the  fact  that  T is  orthogonal.  From  (3-5.7) 
we  get 


which  together  with  (3.5*3)  and  lemma  (3*3)  yields  - 
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(3.5-8) 


-1 


T-l 

= £ (b(^C2>  Lj)  . 

0=0 


The  Problem  of  Inversion  of  r. 


We  recall  that 


O-S-S)  C ■ &,T  5 &,T  « 


where  2 is  defined  by  (3.1.6).  This  means 


So  the  problem  reduces  to  the  problem  of  inversion  of  2.  Now  from 
(3.1.6)  it  is  clear  that  2 is  a symmetric,  banded  block-Toeplitz 
matrix,  which  makes  it  easier  to  compute  2-1.  There  axe  efficient 
algorithms  for  inversion  of  symmetric  block-Toeplitz  matrices,  e.g. 
see  Friedlander,  Morf,  Kailath  and  Ljung  (1978).  The  idea  is  that  for 
an  N x N toeplitz  matrix  R the  inverse  can  be  represented  by 


~ [L.U, 
rN  ~1~1 


■!*&> 


where 
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0 


0 


Now  the  sequences  a and  b can  be  found  recursively.  This  has  also 
been  extended  to  the  block-toeplitz  case.  Now,  when  we  have  a banded 
toeplitz  matrix  then  there  are  simplifications  in  the  recursive  formulae 
(similar  to  the  simplifications  that  arose  in  solving  (3  - 5 • ^ )) and  this 
has  been  done,  in  the  scalar  case  by  Dickinson  (1978).  It  appears  that 
the  method  could  be  extended  to  the  case  of  symmetric  banded  block- 
toeplitz  matrices. 
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Additional  Computing  Hints 


A -1  A -1 

After  computing  Z(o)  8)1(1  ~(0)’  t*161,6  is  one  n>ore  tedious  computa- 

tion in  the  Scoring  method  and  that  is  the  computation  of 

(3.5.10)  r £(£)©£(£))£  • 

If  we  look  at  the  definition  of  F in  (3.2.1U)  and  (3.2.16)  we  notice 
that  to  compute  (3.5.10)  we  have  to  compute  terms  like 

(3.5.11)  (v.c(Kid©I,r)]’(r-J)®f-J))  Ls)  , 

j>u,v  = 1,  • ••,  = 0,1,  • ••,  <1  . 

And  also  terms  that  are  essentially  of  the  form  (3.5-H)  except  that 
one  or  both  of  the  L’s  might  be  replaced  by  L'.  Now  using  lemma 
(3.1),  (3.5.11)  can  be  rewritten  as 

[vec(ElJ®Lr)]'  ’ 

which,  using  lemma  l(i)  of  the  Appendix,  is  equal  to 

(3.5.12)  trIr-J)(Ei.©Lr)rjJ)(Evu®L's)]  . 


Now  let 
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* 


then  by  just  carrying  out  the  multiplications  in  (3.5-12)  we  see 
that  it  is  equal  to 

tr(ruiLrHvL' 8 ) . 

« MW  M 

Note.  Throughout  this  paper  we  have  assumed  that  = 0. 

However  in  practice  the  mean  of  ^ is  unknown  and  will  be  estimated 
by 

! 

- i f 

Then  the  estimating  equations  will  be  the  same  as  in  this  paper  except 
that  will  be  replaced  by  - £ * 


4.  Estimation  in  the  Frequency  Domain. 


4.1.  Preliminaries. 

For  a stationary  process  (z^,  t = 0,^1,  ...}  with  mean  zero  and 
covariances  £8  = s = 0,+l,  . . . , the  spectral  density  matrix 

f is  defined  by 


(4.1.1) 


Ifc>-5c  l 2s 


2n  l*  ~3 

S=-oo 


if  the  sun  converges.  The  covariances  can  be  recovered  from  f (• ) via 


2s  = / f(MeiSNl\  . 


The  sample  analogue  of  the  spectral  density,  the  periodogram,  is  defined 


(4.1.2) 


K\)  = ~ Y c e_isX  > 

2n  - (T-l ) ~S 


where 


C — ~ V z. z!  — C*  * s — 0.1.  ....  T-l  . 
~s  T ~wt+s  ~-s 


We  can  also  represent  l(\)  in  terms  of  the  discrete  Fourier  transforms 


(4.1.3) 


I(\)  = v(\)w*(\)  , 


where 


and  indicates  "conjugate  transpose".  For  a fuller  treatment  see 

Anderson  (1971 ),  Chapter  7» 

If  the  process  (z^.)  is  Gaussian  the  log  likelihood  is 
(4.1.5)  log  i = - | |D|  - | z'D^z-fi  log  2n  , 

where 

£ = •••>  5i )'  ’ 

d = e z z'  . 

^ (V 

Following  Whittle  (1953,1961)  and  Dunsmuir  and  Hannon  (1976),  we  will 
approximate  the  second  term  in  (4.1.5)  by 

-lltrtf’1^)!^)]  , 

t/ 


where  f(\)  and  l(\)  were  defined  by  (4.1.1)  and  (4.1.2)  and 


We  shall  show  the  relation  of  this  approximation  to  that  proposed  by 
Dunamuir  and  Hannan  in  the  Appendix. 


For  our  problem 

St  s Et  ' 2e  H Xs  • 

It  is  well  known  that  the  spectral  density  for  the  process  defined  by 
(2.1)  is  given  by 

(4.1.6)  f(\)  = i-  B'^e^A^V  A*(eiX)B*-1(eiX)  , 

where  B(* ),  A(* ) and  V were  introduced  in  Chapter  2.  (See  Hannan 

/V x 7 <v  w 

(1970)).  It  follows  that  the  spectral  density  of  the  moving  average 
part  is  given  by 

(ua.7)  Sk>  ■ h - if  £ J<s)«-Us  , 

-q. 

where  we  have  used  the  definition  given  in  (4.1.1).  Now  using  (4.1.6) 
and  (4.1.7)  we  get 

(4.1.8)  f(\)  = B'1(eiX)S(\)B*"1(eiX)  . 

Finally,  we  approximate  the  log  likelihood  by 


1 T_1 


where 


L 


It  * - It  *I<V 


As  in  the  time  domain,  we  shall  use  the  Newton -Raphson  and  Scoring 
methods  to  arrive  at  approximate  maximum  likelihood  estimates  that  are 
asymptotically  efficient. 


Note:  For  any  square  matrix  A we  have 


d log  |a| 

“357.  ' TaT  cof  aij  ' <1  )ji  • 


Using  this  we  get 


a log  |a| 


E (A"1)ji  (^}ij  = tr^"x  S3 r) 

J 


dA 


a a 


Using  this  on  (4.1.9)  we  get 


(A.i.io)  §£ 


bt. 


af. 


-I 


2 5oT“ 


.-l. 


+ 2 


it~£t 


4.2.  Hie  First  Derivative  of  A. 


Derivation  of 


dA 


S’ 


To  find  this  we  need  to  find  the  first  order  derivatives  of 
Now  (4.1.8)  can  be  written  as 
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(4.2.1) 


9 


~t  = 2;1  St  2t_1  ' t = 0,1>  •••'  T_1  * 


and  Bt  = ^(e^i),  ^ Differentiating  (4.2.1)  with  respect 

t°  we  get 


bt  + 
~ t 


-1  ^§t  -1.  *-i  -1  *_i  ^t  *-l 


(k'2'2)  ^7h7=“~t  ^Ihj ~t St2t~  -S9A-  ^57 St 


rs 


<3b 


rs 


3b 


rs 


iX.h  _ . -iX.h 

~t  -rs^  ~ *t=si*t  * ' 


= -B'^E  f , e t - f ,E_  B*-1e  * 


where  we  have  differentiated  (2.2)  to  get 


(4.2.3) 


^ r ^th 

5<CT'*»* 

rs 


Using  (4.2.2)  we  can  get  the  derivative  of  tr^1^)  in  the 
following  manner 


(4.2.4) 


3tr(f7xlJ  , bf,  , 

~t~t  = - Mr;1  r;1!.) 


bb 


Jh T 


rs 


v^t  ^b(h  ) it  it' 
rs 

i i i 

- tr<£t2tXA*  S it> 


iX.h  1 1 -iX.h  # * 1 

e trdt£l  & Srs)  + e trQ 


-iX.h  , iX.h  . . 

e £fe  jEfc\*s  + e ^£t^t  5t  ^sr  * 


TfS  = l,...,m,  h = l,...,p 
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Notice  that  in  the  above  expression  the  two  terms  are  conjugates.  To 
see  this  note  that 


i\,h  i i\.h  , , 

• <it£t  St  4r  ■ • (Jt£t  St  >; 


iX.  h iii  iX.h  ^ i 

- « (SISt' It  4.  ■ * <8t& Vrs 


-ix.h 

• <StS  It  4*  ' 


Now  using  (4.1.9)  and  (4.1.10)  we  have 


(4.2.5) 


i P s , i p s «*<£%> 


■ -I  E 


■It  £t 


From  (4.2.4) 


a tr  (f"1.!,  ) -iX.h  , 

l - -Ttr-  ~Ee  * <£*&*>„ 


t av 


iX.h  ■»  “IX. h i 

♦Ee  (itlk  >„  - 2 E e <St  £t  It 4.  - 


u - sr  “ 


since  the  sums  are  real  and  the  summands  are  conjugates.  The  sums  axe 

real  because  the  summands  sire  functions  of  e and  because 
iV  -iX_  . 

e z = e 1 . So  (4.2.5)  can  be  written  as 


r 


_lxth 


-iAh 

t "J-  1 


-^T--Ze  <5t  £t  it)r«  * £ • 'Or.- 


But  we  notice  that  the  second  term  on  the  right  hand  side  is  o(T). 
This  is  because 


? i ^“Or.  - h /;v-(b-)^  . o 


Since 


j-i\h(B*-1)  = g d e-iX(^) 

j=0  J 


-ix(d*) 


d\  = 0 for  h ^ 0 j = 0,1,.., 


Rote:  We  can  omit  terms  that  are  o(T)  because  of  the  forms  of  the 
Nevton-Raphson  and  Scoring  method  equations  and  the  fact  that 


I * 


So  we  finally  get 


3 A . r _JAt  , *-l  -1T  x 

35T  = “?e  ^^t 


-iA.h 


-E®  * vec(St1Btit)  ' h = 1, .. . ,p  . 

t 
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Putting  these  together  for  h = 1,  ...,  p we  get 


(4.2.6) 


~ t 


-l. 


where 


-i\.  -2iX  -pi\ 

(4.2.7)  gj.  5g'(\)  = (e  >e  , e )®I2. 

m 


Derivation  of 


* 


We  shall  need  the  derivatives  of  £t  which  follow  from  (4.1.7) 


and  (4.2.1) 


(4.2.8) 


a~t  -1  a~t  *-l  _ 1_  -1£  B*-l 
. (0)  St  (0)  St  ~ 2n  St  Ski  St  * 
a°ki  8a„ 


ki 


"n 


- - b (S'SbMk1)^ 

■ 2*  ^St  StitSt^t  ^ 

‘ 2n  ^ StitBtS  » 


since  the  matrix  in  brackets  is  Hermitian.  This  yields 


a trt&V 
_ »S> 


sr  > 


which  in  turn  yields 


(4.2.10) 


a£o 


- - h I ^S^sk1) 

t 

- - sr  ? ™=<s;1Btitg&;1> , 


since  the  sum  is  real.  Now,  using  (4.1.10) 


■=  - i — Jtptf”1!.)*!  — ytrtfT1!.) 

2 . ~2  *'t  2 as  r 


s2o 


l£t~£t 


Now 


(l‘-2-u)  is  | - - h ? «« & 

~0 


it  ~t 


-1 

t 


This  together  with  (4.2.10)  yields 


(4.2.12) 


t 


a£o 


We 


proceed  to  derive  — — • Using  (4.2.9)  we  have 


(4.2.13) 


a ‘HSV 


-‘V 


^tS  V *-l  -1 

w \t>  Xx*  -T 


IsT 


ki 


From  this  we  get 


. -ik,  s * 5 • 

- h e <2t  WtSA  >« 


u 

!Tsl 


So, 


Xt 


ke 


This  means 


i\.  s 


•v  ~ 

Se. 


which  finally  gives  us 


- -H*  t yec^\h^t ) » 


iX^s 


(4.2.14)  1 iE«'V  vec^JjijB&Ss;1)  , s ■ • 


Se 

~s 


using  (4.1.10).  Putting  (4.2.12)  and  (4.2.13)  together,  we  get 
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(k.2.n)  & - h l s it 

°R  t 


,-l. 


* -1  -1 . 
-1-  /*\  -1- 


Note.  Biere  is  an  alternative  form  for  (4.2.16)  -which  we  shall 
find  more  useful  in  deriving  the  second  derivatives  of  A.  To  derive 
this  alternative  form  we  note  that  (4.2.15)  can  be  rewritten  as 


(4.2.18) 


s tr<&V 


8a, 


GT 


i uts 
- — f e ** 
2n  16 


-iX.  s 

St  + e St  ^ > 


wl lere 


St.  " St  StitStSt 


and  we  have  used  the  fact  that  is  a Hermitian  matrix.  Now, 
from  (4.2.18)  we  get 
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s i , *v 


1 , ^t8  "iXtS  , , 

2ft  te  vec  £t  6 V6C  ®t] 


But  using  lemma  (3.2)  we  have 


vec  Si  = K m vec  S+  = 1L  vec  8+.  • 


8tr<£t‘.Jt>  1 , *V.  . -iV..  , 

Se -sr[e  l2*e  S«lvecfe' 


i 


which  gives  us 


|r  = cr?(e  t ~ 2 + e 1 ^ yec^it-^t1) 


iXL  s -iX,  s 


Using  this  and  (4.2. 12)  we  get 


(‘*•2.19)  ^ = 17  E St  ^(et-s;1) 


where 


, , I**  -qiXt 

(4.2.20)  Ht  = H (\t)  = (l,e  *,  . . . , e t)®I2+(l,e  \ ...,e  t)®I^ 

m 


Finally  for  ^ we  have  the  alternative  form 


(4.2.21) 


H = h I s St  yec^A1)  * 

~ t 


4.3.  Second  Order  Derivatives  of  A. 


a2A 


Derivation  of  . 

Using  lemma  (3-1)  we  can  rewrite  (4.2.6)  as 


(4.3.1) 


It'-EgtO^S^lvecSt 

u 


s-1- 


Differentiating  this  with  respect  to  P'  we  get 


(4.3-2) 


>2  . 3 vec  B. 

= - St  > 


Now,  from  (2.2) 


- I 


*V 


vec  St  = pi  e • 

^ j=0 


which  gives  us 


3 vec  B,  iX  j 

~~W.  - e I,o-l, 


.,p  . 


Or 


d vec  B.  iX,  PiX.  * 

(e  ....  e *)®I2  - St 

in 


Substituting  this  in  (4.3.2)  we  get 


(4.3*3 ) 


= ' ?6t(£t®S'L)ft 


lx  * 
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Derivation  of  • 


We  shall  first  find and  then  use 

S PS?’ 


(4.3.4) 


S A _ S A q( 

SPS0'  ~ SPS?'  ~ 


Now,  (4.2.6)  can  be  rewritten  as 


" - l Si 


Differentiating  with  respect  to  x we  get 


WTx  = - £ St(Jt2i®  ^ 


S vec 


- E 6t(ltSt®4n)  vec(£1  -Jr  St1* 


i St<ju;®  iX'1®  s;1) 

w 


, s vec  Q. 


By  letting  x be  the  components  of  £ we  get 


(4.3-5) 


agae'  '£  3j,  81 


Now,  as  in  (4.2.8)  and  (4.2.9) 


dcx. 


sSt 

Toy 


2«  Ski * 


ki 


sSt 


. -iX  s iX.  s 

ST [e  I • 


Vectorizing  these  results  in 


(4.3.6) 


8 vec 

Toy -sr  vec«w>  = 5rsw  - ■i’- 


bcr, 


(U.3-7) 


ki 


d vec  %.  -i^s  iX.s 


3 a. 


' ^u.  1 x/v.  » 

TTT  = S?!e  Swte  S«k!  • 


ki 


But  ^k  “ vec  Sgk  = 760  Sfcg  - using  lemma  (3.2).  So 


(4.3.8) 


8 vec  ^ 

JsJ~  =2Z[e 


da. 


ki 


-iX^s  iX.  s 

*2  + * ■ 

m 


Now  (4.3.6)  and  (4.3.8)  yield 


(M.9) 


8 vec 


St 


d2o 


2n^®ll,Sei»  •' 


e l-Il 
* Smm  2it  i 2 ’ 

m 


(4.3.10) 


d vec  g.  i IX.  s iX  s 

^r^<e  ‘v**  *■> 


Finally  putting  these  together  we  have 
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t 


(U.3.11) 


a wc  St 
a S' 


1 * 
— ^ H 

“ 2n  2t 


Substituting  this  in  (4.3*5)  we  have 


Substituting  this  in  (4.3.4)  yields 

<i*-3-i2)  m&  £ s*^-1®^  • 

<3P  o0  t 

w »v 


Derivation  of  . 

ae  ae' 

M M 

Differentiating  (4.2.21)  with  respe<  to  £’  we  get 


(4.3.13) 


a2A 


ae  ae' 


a vec  P, 


where 


St  - ^SdtSA1-^1  = ^ &1  • 


So  we  need  to  find  the  first  order  derivatives  of  P^.  . Differentiating 
p with  respect  to  x we  get 


-1  -1  ^S  _i 

St  -St  55 St 


Vectorizing  both  sides  yields 
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I 


^ VCC  P I 1 1 ^ Sa- 

{k.i.ik)  — ■ -i  (j;®  s;  » vec-5r 


= - J3t  vec  sr  » say 


Now,  for  x = ar^  we  get 


S vec  P 
~t 


% vec(hha) 


" 2 n > 


which  gives  us 


(k.3.15) 


d vec  P, 
~t 


-h  «t 


Similarly  for  x = we  get 


S vec  P 
Mi 


-i\  s iVs 

~ 2n  %t(e  J 2 + 6 Mn^k0 
in 


(fc.3.1 6) 


d vec  P. 
~t 


-iX.s  i\  s 

- 7— * M.  (e  ■*-?  + e K),s=l,...,q 

£7T  ~ HU 

m 


Putting  (k.3.15)  and  (b.3.16)  together  we  get 


5 vec  P, 
~t 


-srStBj  • 


(4.3.17) 


3 vec  P, 


be' 


-*  - gratis’  • 


Substituting  this  in  (4.3.13)  we  get 


2 

(M.18)  -hi  SSj^hJo’  . 

3e3e'  8*  t ^ 

This  completes  the  derivation  of  the  second  order  derivatives  of  A. 


U.4.  The  Newton-Raphson  Method. 

This  method  consists  of  solving  the  following  system  of  linear 
equations  for  P^  ^ 


(4.4.1) 


3 A 


3p  3p' 


/ A a \ 3A 

«(o,  (£(1,'£(0,) 


5*£(o) 


We  get  the  matrix  of  second  order  derivatives  of  A , evaluated  at 
p = £(0y  from  (4.3.3),  (4.3.12)  and  (4.3.18) 


(4.4.2) 


32A 

3p  3p' 


£=£(0) 


/ E §t^t®  ?t(°)^t  " 2*  £ St^t2t(o)St(o)®  St  &tS 
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from  (4.2.6) 


We  get  the  first  derivative  of  A,  evaluated  at  £ = £,(o)* 
and  (4.2.17) 


(4.4.3) 


dA 

3p! 


I St 


£ £(o) 


V GJ.  vec  P 

2rr  £ ~ 


(0) 


We  substitute  these  in  (4.4.1)  and  solve  for  p^  . 


4.3.  The  Scoring  Method. 

This  method  consists  of  solving  the  following  system  of  linear 
equations  for  ^ . 


(4.5.1) 


SA  I 

^£i£=£(  o)  * 


Now,  we  have  seen  in  section  (4.3 ) that  > 

observations  only  through  t =0,  ...,  T-l. 

of  the  Appendix  allows  us  to  replace  by  f^. 

expectations  of  (4.3*3 )>  (4.3.12)  and  (4.3.18). 


depends  on  the 
Lemma  (6) 
when  taking 
So  we  get 


(4.5.2) 


= -i  - 

t 
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0>.5.3)  ' e(5T  ^6t(it5tSt 1 ® )5tS"  ’ 


-h  EStlltStSt’1® S^&tS’ 


k £8 t<5t  ®S  >St£' 

W 


Finally 


e(^l^r)  = ■ 72  e(£  sStMtS } 

•**  *>*  OTt  *C 


si'1®  it  * si'1®  s;1  - 


tfiere 


which  means 


~t  = 2t£t£tSt  " ~t 


e h 1 ^StitSis;1  - s;1  - 0 < 


ft  m.  = q! -1®  0T1 


Hence 


(^5.m  e(^^7)  “ ' jjj?  £s~t(St 1<s>  St^StS 


/■%  ” 1 \rr  n 


So  we  evaluate  these  expressions  at  £ = £^0)  an^  then  substitute  them 
in  (U-.  5.1)  and  solve  the  resulting  system  of  linear  equations  for  £qj. 


4.6.  The  Pure  Moving  Average  Case. 

In  this  special  case  we  have 

2t  = * £t  s St 


which  means 


St  ~t  £t£t  * 


2t  * • 


Using  these  and  (4.3.18)  we  can  get  the  Newton-Raphson  equations 


- 1.  II  25^(0)352' l(s(1,4(0))  z 2itvec  2t<0)  . 


This  is  identical  to  the  equation  derived  by  Ahrabi  (1978).  Similarly 
the  equation  for  the  scoring  method  can  be  derived  using  (4.5.4) 


tlgSt  (^(o)®!‘)o)>!&,!(i(i)-I(o))  - arl22><=  !t(0)  > 


which  is  identical  to  the  equation  derived  by  Ahrabi  (1978). 
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5.  Asymptotic  Properties. 


Hie  four  estimates  proposed  in  the  preceding  chapters  are  asymptoti- 
cally equivalent  and  we  shall  show  that  they  are  asymptotically  efficient 


i.e. , 


VT  (p(1)-p)  ^n(o,  £_1(p))  , 

where  i3(g)  is  the  limiting  average  information  matrix  and  " 
indicates  convergence  in  distribution. 

To  find  a(g),  by  definition  we  have 


$(p)  = lim 

i e(£ losi, 

T -*oo 

Sp3p' 

= lim 

T -••oo 

1 dpdP1 

»v 

Now  let 

[ cp  £}'  \ 

(5.1) 

*(p)  = 

~ ~ 1 . 

12  t / 

Then  from  (4.5.2),  (4.5.5)  and  (4.5.4)  we  have 


(5.2)  z-  llm  i 

T 00  t 

- g“  f 2n»(f,®Q"1)R*d\  , 
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t 


2*  -f  <sr  l ®St  >St£> 

T 00  t 


i r2n  i i * 

^ / ft  (5  © Q"  )H  G*  d\  . 

(2")  J0 


(5.U) 


t-11”  f <r?  I sst(Si*i®sr'1Bt£') 

T “♦  oo  ofl  t 


-\*  /Ov  /-\ ”1  \rr*/- 


'd?  /"  22<S''1®S'Vg,d»- 


The  four  estimates  are  obtained  from  equations  like 


(5.5) 


f /A  o 1 - - ^lQS  1 
~(0)  ~(1)  '"(0)  ~ T 3p  . 

~ ~ ^(0  ) 


where  t(o)  is  30  estimate  of  g(p)  and  is  consistent.  We 

can  write  (5*5)  as 


(5-6)  l< o,V5  <£(i)*£)  ■ 1(0)  (£(0)-fi)  + J i4rf1|  H(0)  ’ 


where  P is  the  true  parameter  value.  Now 


(5.7,  A liS£j  = 4 iiSS-L  +4  liSSj.  (p.„*  ), 

7t  3 o /t  3 p P=P/n\  /t  SpSp’  p=p  ^ ' 

~ ~ ~ ~(0  ) ~ ~ ~ ~ 


where  |p-P+|  < | P-P fn  ) I * Now  (5*6)  can  be  rewritten  using  (5*7) 


(5.8)  ^(0)VT  (P(1)-£)  - tS(o)  + T 


1 c*  log  1 1 


5pS0'|p=p 


, a v . i a log  a 

x 7t  (P/rt«-p)  + — — 

~(o)  ~ Vt 


Now  noticing  that 


i 3 log  a 
T ^ 

a p a p ' p=p 


- -o(p) 


and  that  Vt  (pfor£)  is  bounded  in  probability,  we  see  that  (5*8)  is 


(asymptotically ) equivalent  to 


(5-9) 


* «u>*>  - % "ff 


Theorem.  If  in  addition  to  Assumptions  1-5  of  Chapter  2 we  assume 

that  the  e 's  have  finite  fourth  order  moments,  then 
~t  


(5,10)  Vt  (£/,  \-p)  ^ N(2>2  1^2,^  * 


vjhere  p,  s is  any  one  of  the  four  estimates  proposed  in  the  previous 
~(1)  “ 

chapters. 

Proof.  Using  (5-9)  it  suffices  to  show  that 


(5.11) 


-i  ^ logj  ^ N(0,-S(p) ) 

Vt  a e 


(5.12) 


A A A . 

where  is  an  initial  estimate  of  ^ and  tf^^)  an  initial 

estimate  of  the  limiting  average  information  matrix  of  . Now 
applying  the  same  argument  as  we  did  for  p we  see  that  (5.16)  is 
equivalent  to 

(5.17)  Vt  <i(1)-i) 

Now  (5-15)  and  (5*17)  imply 


(5.18)  -1  ^ log  l jvN(o,.)  . 


Finally  from  (5. 14),  using  (5.18),  we  get 


(5.19)  ~ • 


The  desired  result  is  obtained  from  (5.9),  which  together  with  (5-19) 
gives  us 


(5-20)  (£(1)-£)  -»»(0,j'1(e))  • 

v ' Q.E.D. 


Note.  The  approximation  to  the  log  likelihood  used  by  Nicholls 
(1976)  is  not  identical  to  ours,  i.e.,  (4.1.9).  But  as  we  shall 
demonstrate  in  the  Appendix  it  is  asymptotically  equivalent  to  it.  This 
means  that  if  Nicholls  had  used  (4.1.9)  he  would  still  have  obtained 
asymptotically  efficient  estimates,  as  we  assumed  in  the  proof  above. 
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6.  Estimation  of  the  Coefficients  and  the  Covariance  Matrix 


of  the  e.  's  in  the  Time  Domain  (The  Scoring  Method). 
~t  


For  the  model  defined  by  (2.1),  Rexnsel (1976 ) gives  equations  for 
the  estimates  of  A.,...,A  , B.,...,B  , V,  using  Newton-Raphson 

~_L  ~(J  y ** 

method  on  the  (modified)  log  likelihood  of  the  data.  In  this  chapter 
we  shall  use  the  techniques  developed  in  the  preceding  chapters  to 
arrive  at  the  equations  for  the  estimatesof  these  parameters  using  the 
scoring  method. 


The  Likelihood  Function. 


Assuming  that  = e ..  =••■=£,  =0,  and  using  the  same  method 

~-_L  -q  ~ 

as  in  section  3.1  we  have 


(6.1) 


®y  = a e , 


where 


(6.2) 


«-  it*!®!1)’ 


i=0 


But 


~ iT®  V)  • 

So 

(6.5)  e<!£‘>-  5.,T<5r©X^,T  * i®ir  ' i ’ 

say,  where  we  have  used  lemma  3.3.  Using  (6.1)  and  (6.5)  we  have 

y ~ N(0,  ®_1a  i a'®'"1)  • 

So  finally  the  modified  log  likelihood  is 

(6.6)  log  ity  = - f log  2*  - | iog|»-1ajga,»,~1l 

- |y,®’s,‘W1Sx  • 

We  can  further  simplify  this  by  noticing  that  as  in  (3.5*2) 

1*1  = l«l  = 1 ’ 

and 

IJI  ■ |V®  Jjl  - |V|T  . 

So  (6.6)  becomes 


(6.7)  log  log  2n  ' I log  lYl  - | JC'®'S"V:L2,";i®)(* 

The  Estimation  Method. 

The  parameters  to  be  estimated  are 


Zi  = vec  , i = l,...,q  , 

= vec  Bj  , j = l,...,p  , 

V . 

We  let 


(6.8) 


a 


k 


where  a'  . (a* , . . . , a*  ),  P'  O',  . ..,  P' ).  We  are  going  to  apply 
the  scoring  method  to  arrive  at  approximate  maximum  likelihood  estimates 
that  are  asymptotically  efficient.  It  is  well  known  (See  Dunsmuir  and 
Hannan  (1576).  ) that 


(6.9) 


lim 

T -•  00 


1 

T 


This  means  that  the  limiting  average  information  matrix  is  block  diagonal. 
So  we  can  write  separate  equations  for  estimates  of  and  V without 
violating  asymptotic  efficiency  of  the  estimates.  We  shall  first  derive 
the  quations  for  q ^ y the  estimate  of  ^ by  the  scoring  method.  These 
equations  are 


(6.10) 


~ ~ n /„  \ 


l^(o ) 
2-2(0) 


3 log  l | 

= “5T  M(0)  ' 


2-2(o) 


where  n , and  V,~x  are  initial  estimates  of  r)  and  V that  are 

«>  (0 ) ~ (0 ; * 

consistent  of  order  T-1^2,  as  given  by  Reinsel  (1 976).  We  proceed 
to  find  the  first  and  second  order  derivatives  of  log  l with  respect 


to  a. 


The  First  Order  Derivatives. 
Derivation  of  ^ ^ . 


(h) 


Differentiating  (6.7)  with  respect  to  we  get 


^-1PA-  = y'  ®'  s'  '1j‘1a'1  (e.  . ® Lh  )e“1sx 
—(h)  *.  ~ ~ ~ ~ ~iJ  ~ ~ 

aaio 


= vec(£ij®i;h)  » 


using  lemma  3.1.  Using  the  same  method  as  used  in  deriving  (3.2.5) 
we  get 


(6.11) 


where 
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A 


(6.12)  1 - (Ex,  Eq)  , 
and  E was  defined  in  (3.2.4). 

***JT 

Derivation  of  ^ ^ . 

This  was  derived  in  Chapter  3 and  is  given  by  (3.2.7),  which 
states 

(6.13)  ~~t°Pg  ~ = -E‘  (y®r-1B^)  . 

Now  we  need  to  express  r-1  in  terms  of  the  parameters  in  this  chapter. 
We  recall  that 

(6.14 ) T = C(uu'  ) , 

IV  (V  IV 

where 

2 = »!£> 

as  given  by  (3.1.4).  Now  using  (6.1),  (6.4),  (6.14)  we  have 


r = e(ae  e'a* ) = eja’  . 


i 


Hie  Second  Order  Derivatives. 


L 


Derivation  of  8 . 


.(h) 


Differentiating  (6.11)  with  respect  to  a^  we  get 


(6.16) 


82log  i 
~ ij 


■I'  [g"1(Eij  © Lh  )a_1®  jr  © a'  ■1j2'1a"1®jr 


+ ‘1(E^©  L*h  )a'  "1J’1a*1sx. + «'  "11S"1a'1 


Now,  using  lemmas  3*4,  (6.l6)  can  be  rewritten  as 


(6.17) 


d2log  i 

' (h) 


6a  S a. 


-E'  vec[a' "1J"1a"1»y  y'S’a' "1(E!  .©Lh)a''1 

/w  <v  m Iv  fv  rv  ^0.  J «v 


ij 


+ a,-1(El ,®  L,h)8,*V1a'1syy,8'a'*1  + a'  "13"1a-1 


,-l 


(£ij®i  )S  UZ  ® S ]< 


Taking  expectations  of  both  sides  of  (6.17)  and  using  (6.5)  we  have 


(6.18) 


e(""g(hV  =-2£’  vec[S,'1(Eij®i:'h)2,‘1 

8a  8 a. . 

~ ij 


-E'  vec[a'  ■1i“1a“1(E.  . © Lh  )J  ] 


As  we  shall  now  show  in  the  Appendix  (lemma  3 ) the  first  term  on  the 
right  hand  side  of  (6.18)  is  equal  to  zero.  Using  this  and  lemma  3.1, 
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(6.18)  can  be  rewritten  as 


£ ( d lo^A  = -E'  (&®  fl  h 1a  1)  vec  E,  © Lh)  . 

dcTaapP ~ij  ~ 

~ ij 


From  this  we  finally  get 

(6-19)  - -E'(|3©«"14'Y1)I  • 


Derivation  of 

Differentiating  (6.11)  with  respect  to  b^j^  we  get 

(6.20 ) = I' t?'1  %.s  ® Lh  )jr<2>  a'  'VV1^ 

~ 1.1 

+ S-1®*®**’ " W1  C£ij®  £h W 

= s’  ^[rVY1^^ (e^®  L'^a'-1 


using  lemma  3.4.  Taking  expectations  we  have 


(6.21) 


etf1”6  fh))  -I'  ve°tS,'1(Eij® 

S~9bij  J 

+ E*  vec[ a'  -1J”1a"1  (E.  .®  Lh)»_1aJ]  . 

~ ~ ~ ~ v~ij  ~ ~ ~>v 
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r 

■Rie  first  term  on  the  right  hand  side  of  (6.21)  is  equal  to  zero.  This 
follows  from  lemma  2 of  the  Appendix.  Now  using  lemma  3.1,  (6.21)  can 
be  rewritten  as 

s ( d lo^lj-)  = E'  («j'  a' »'  _1  © fl'  "1y1g'1 ) vec  (EtJ  ® Lh ) • 

From  this  we  finally  get 

(6.22)  e(|)f| £>■  !'(4,«,»'-1®S"W1>S  • 

! i? 

Derivation  of  S (^^^,^  • 

<V 

This  has  al ready  been  derived  in  (3.3.14)  and  (3.3.15)  'which  together 
yield 

e (fjrfM-  > * -S'  <S_1t  S' _1®  £_1  « • 

But 

r = a J a1  . 

M ~ Sr 

So 

(6.23)  ^(Sn&r-)  = -e*  (s^a* a' s'"1®  a,"VV1)E  • 

Putting  (6.19),  (6.22),  and  (6.23)  together  we  get 
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I 


r 


<6-21,>  -«<ip£r->  ■ 


®_1a  a 


rf'aV1 


-®,"1aja'®,-;L 


(E  ,E)  = W,  say  . 

(V  (W 


<2>  e"W1 


The  Scoring  Method  Equation  for  tj^j. 

Substituting  the  expressions  derived  for  the  first  and  second  order 
derivatives  of  log  i in  (6.10),  we  get  the  desired  equation,  which  is 


(6.25) 


2(o)(5(i)-l(o))  = 2(0) 


where 


(6.26) 


w = \ lpg  1 
~(o ) ^ 


1^(0) 

M(o) 


and  ^ lo6  ^ is  given  by  (6.11)  and  (6.13). 

Estimation  of  V. 

— — ■ ■■■■■■■ 

A A A 

Once  we  have  we  can  replace  a and  ® with  a^, 

in  (6.7)  and  maximize  the  resulting  function,  which  we  denote  by 
log  7 . So  we  will  maximize 


Now  the  last  term  can  be  rewritten  as 


(6.28)  " 2 X 2(i)2(i)£m,T 


, T 

-I  E! 

* t=i 


td)~  ~td) 


This  follows  from  (6.1)  and  (6.4).  Now  it  is  well  known  that  the  value 
of  V which  maximizes  (6.27)  is  given  by 

/v 


(6.29) 


(r  1 r-<  C At 


See  Anderson  (1958),  Chapter  5.  We  can  express  in  terms  of 

®,  .,  y using  the  same  argument  as  in  (6.28).  So 
~(1)  ~ 


(6.30) 


~(D 


?£,t*U)S(i)W  SftKKi 


We  could  theoretically  carry  out  further  iterations,  but  this  would 
be  computationaly  costly.  The  estimates  given  above  are  asymptotically 
equivalent  to  the  estimates  derived  via  Newton -Raph son  method  and 
hence  are  asymptotically  efficient  as  demonstrated  by  Reinsel  (1976  )• 
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We  shall  now  derive  some  of  the  results  that  we  have  used  in  the 


previous  chapters. 

The  Time  Domain. 
Lemma  1.  (i ) For 


two  matrices  A and  B 
rx  s s x r 


tr(AB)  = (vec  A)'  vec  B'  . 

~ r*  ' /-w  /<w 


(ii)  For  square  matrices  A and  B 


tr(A©  B)  = tr(A)  • tr(B)  . 


Proof.  (1)  and  (2)  are  easily  verified  by  •■’riting  out  the 


two  sides. 


Lemma  2.  For  S,  8,  E.__,  E_._,  and  L as  defined  in  the  time  domain 
■ ■ - - ■ ew  ~ ~rs  — — ■ '■■■  ■■  ■ - — ■ ■ — 


(3)  tr[e-1(E  ®Lh)8_1(E  (2)Lk)]  =0  , 

~ ~oiv  ~ ^ v~r3  ~ /J  7 


for  positive  integers  h,k. 

Proof.  From  (3. 5-3)  we  have 


8'1  = Y(£(i)(E>Li) 


Similarly 


2"1  - L (A(j)©LJ) 
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So  the  left  hand  side  of  (3 ) can  be  rewritten  as 


(4) 


T-l  T-l 

r 7 tr(AU;E  BU\©  LJ 

Li  La  ' **  ~rs 

j=0  i=0 


. ( j )tt.  T j+h+i+k 


Now,  Lemma  l(ii)  applied  to  the  summand  in  (4)  yields 


(5) 


tr(A(d^E  B(i)E  ) • tr(LJ+il+1+k)  = 0 , 


since  i+j-di+k  is  a positive  integer  and 


tr(L  ) - 0 , r - 1,2,  ...  . 


This  means  that  (4 ) is  identically  zero  which  proves  the  lemma. 


Lemma  3. 


(6)  E'  vec[fl' 

Proof.  The  left  hand 
element  of  which  is 

(7)  vec(E  © Lk)  vec[a  _1  (E,  . © L h )$  . 

This  follows  from  the  definition  of  E as  given  by  (6.12).  Now 
applying  Lemma  1 (i ) and  Lemma  2 to  (7 ) we  show  that  it  is  identically 
zero. 

Q.  E.  D. 


side  of  (6)  is  a column  vector,  a typical 
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Note:  It  is  obvious  that  by  the  same  reasoning  as  in  Lemma  5 


(8)  E*  vec[fl' "1(Ei^.©  L'h)»’ "1]  = 0 , 

which  was  used  in  (6.l8). 

The  Frequency  Domain. 

Lemma  4.  For  A(*  ) and  B(* ) as  in  Chapter  2 

r 2n  r 2n 

(9)  log  | A A*  | d\  = I log  | B B | d\  = 0 , 

J0  J0 


where  we  have  omitted  the  argument  e^\ 

Proof.  We  shall  prove  this  Lemma  for  A and  the  argument  for  B 
is  identical.  We  shall  show  that 


Then  since  for  a = 0 
r 


the  desired  result  will  follow.  To  derive  the  left  hand  side  of 
LO ) we  have 


log|AA*|  - tr(A*_1A  e'^A* ) + tr(A*  ±A  1AE  e"inA) 


*-1.-1. » -ih\ 


da. 


j ''o,»wiv  ' iv  ~rs  ^ I.  iv^gr 


rs 


, -1_  i\h,  /.*-l_  -i\h N 
= tr(A  j£rse  ) + tr(A  E^e  ) 


~sr 


iXh,  -lx  A -lXh  , *-l  \ 

= e (A  ) + e (A  ) 

st  ~ rs 


From  this  we  get 


^ loslAA  I i\h  t -1  -iXh/"1 

= e A + e A 


So  finally 


5£ 


- f 2”  logl  AA*I  d\  - f2”  elU,A’-V  * f 

Jn  Jn  Jc, 


2n 


e-i)OiA*-ldx  = Q ^ 


since  a’  1 is  a power  series  in  e^  and  A*  1 a power  series  in 


-iX  . 
e and 


f2n  ik\  /'2n  -ik\ 

| e d\  = / e d\  = 0 

'0  ^0 


0 


Q.E.D. 


Lemma 


_5.  For  f^,  V as  in  Chapter  4. 


T-l 


(11)  | £ log I £t  I -*  -m  log  2 « + log|v| 


t--0 


Proof.  The  left  hand  side,  as  T -*  oo,  tends  to 


* ^ 
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(12) 


i r2n 

h J0  1°gl£(>oldx-  • 


But 


|f(\)|  = (2n)~m|B'1AVA*B*"1| 

' •>*  >VlV  M />/ 


(2n)^|v||BB*r1|AA*|  , 


which  yields 


log|f(\)|  = -m  log  2n+log|v|  -log|jBB*|  + log|AA*| 


So  (12)  becomes 


i r2n 

-m  log  2«  + loe!  v|  + ^ (log I A A* | - logl  B B*|  ) d\ 
= -m  log  2it  + log  | V|  , 


using  Lemma  4. 

Q..E.D. 

Note.  It  follows  from  Lemma  5 that  the  modified  log  likelihood 
used  in  Chapter  4,  given  by  (4.1.9)  is  asymptotically  equivalent  to 
the  one  used  by  Dunsmuir  and  Hannan  (1976)  and  Nlcholls  (1976).  One 
consequence  of  this  is  that  maximizing  (4.1.9)  with  respect  to 
A^,  . ..,  Aq,  B^,  ...,  jBp,  V leads  to  asymptotically  efficient  estimates 
for  these  parameters.  We  have  used  this  in  the  proof  of  the  theorem  in 
Chapter  5. 
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Lemma  6.  For  1^.  and  as  in  Chapter  4 


(13) 


where 


lie  lt  ‘ - gT  ' = °'1>  •’  > T"1  > 


|( A||  = tr  AA*, 


for  any  matrix  A and 


gT  = o(l)  . 


Proof.  Using  (4.1.2)  we  get 


c 2n  , , T, 


-is\. 


“t  2jt  I | t*  Is  ^ 
s < T-l 


So 


(14 ) e T . f , . 1 £ 

~t  ~t  2jt  i | 


s > T-l 


Xs  e 


-is\.  . -is\, 

* -dt . .E_  1 


Is  <T-1 


Using  triangle  inequality  on  (14  ) yields 


<15)  ,E  IWsir,  ,E  llsv.ll  - Rj . 

I s I > T-l  I s < T-l  1 


say.  Now  the  first  term  of  g?  is  0(1).  This  follows  from 
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— V v — f , 

2ji  | | “ ~s  ^0  , 

s <T 


which  means 


Y,  llVgll  -♦  0 as  T ♦ oo  , 


s > T 


The  second  term  of  gT  is  also  o(l)  because 


(16) 


= — E sv 

oX  \=0  2jt  “ ~s 

s = —GO 


-is\ 

e , 


this  follows  because  f is  a rational  function  of  e^\  Now  (l6) 
implies  that 


, E II3VJI  < l lisvsll  < . 

S <T-1  S = -<x> 


Q.  E.  D. 


Note.  Lemma  6 says  that 


lim  S I.  = f . , 
~t  ~t 

1 ■*  00 


and  the  convergence  is  uniform  in  t.  This  enabled  us  to  derive  a 
suitable  approximation  to 

w-iL4 ) 

' oP  dp' 

in  section  4.5. 
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